-4-ji/u^\ 
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1 


<110> 


APPLICANT: McCarthy, Sean A 






















2 


<120> 


TITLE OF INVENTION: 


NOVEL CRSP 


PROTEIN 


AND 


NUCLEIC 


ACTD 


MOLECULES 


AND 


3 




THEREFOR 




























4 


<130> 


FILE REFERENCE: 


ME I 


-008 
























5 


<140> 


CURRENT 


APPLICATION 


NUMBER: 


US/09/009, 802B 














6 


<141> 


CURRENT 


FILING 


DATE 


: 1998-01-20 




















7 


<150> 


EARLIER 


APPLICATION 


NUMBER: 


08/842, 898 








E 


EN 


ITE 


■ D 


8 


<151> 


EARLIER 


FILING 


DATE 


: 1997-04-17 


















. n 


9 


<150> 


EARLIER 


APPLICATION 


NUMBER: 


60/071, 589 
















10 


<151> 


EARLIER 


FILING 


DATE 


: 1998-01-15 




















11 


<160> 


NUMBER OF SEQ ID NOS : 19 






















12 


<170> 


SOFTWARE: Patentln Ver. 


2.0 






















13 


<210> 


SEQ 


ID NO 1 




























14 


<211> 


LENGTH : 


2479 




























15 


<212> 


TYPE: DNA 




























16 


<213> 


ORGANISM: Homo 


sapiens 
























17 


<220> 


FEATURE 






























18 


<221> 


NAME /KEY: CDS 




























19 


<222> 


LOCATION: (38) . 


. (1087) 
























20 


<220> 


FEATURE 






























21 


<223> 


OTHER INFORMATION: 


n' at position 


1146 


may 


be 


any 


nucleotide 




22 


<400> 


SEQUENCE : 1 




























23 




ggcacgaggg ggcggcggct gcgggcgcag agcggag atg cag egg ctt ggg gec 


55 


24 






















Met Gin Arg Leu Gly Ala 




25 
























1 








5 




26 




acc 


ctg 


ctg tgc 


ctg 


ctg 


ctg 


gcg 


gcg 


gcg 


gtc 


ccc 


acg 


gee 


ccc 


gcg 


103 


27 




Thr 


Leu 


Leu Cys 


Leu 


Leu 


Leu 


Ala 


Ala 


Ala 


Val 


Pro 


Thr 


Ala 


Pro 


Ala 




28 








10 










15 










20 








29 




ccc 


get 


ccg acg 


gcg 


acc 


teg 


get 


cca 


gtc 


aag 


ccc 


ggc 


ccg 


get 


etc 


151 


30 




Pro 


Ala 


Pro Thr 


Ala 


Thr 


Ser 


Ala 


Pro 


Val 


Lys 


Pro 


Gly 


Pro 


Ala 


Leu 




31 








25 








30 










35 










32 




age 


tac 


ccg cag 


gag 


gag 


gee 


acc 


etc 


aat 


gag 


atg 


ttc 


cgc 


gag 


gtt 


199 


33 




Ser 


Tyr 


Pro Gin 


Glu 


Glu 


Ala 


Thr 


Leu 


Asn 


Glu 


Met 


Phe 


Arg 


Glu 


Val 




34 






40 








45 










50 












35 




gag 


gaa 


ctg atg 


gag 


gac 


acg 


cag 


cac 


aaa 


ttg 


cgc 


age 


gcg 


gtg 


gaa 


247 


36 




Glu 


Glu 


Leu Met 


Glu 


Asp 


Thr 


Gin 


His 


Lys 


Leu 


Arg 


Ser 


Ala 


val 


Glu 




37 




55 








60 










65 










70 




38 




gag 


atg 


gag gca 


gaa 


gaa 


get 


get 


get 


aaa 


gca 


tea 


tea 


gaa 


gtg 


aac 


295 


39 




Glu 


Met 


Glu Ala 


Glu 


Glu 


Ala 


Ala 


Ala 


Lys 


Ala 


Ser 


Ser 


Glu 


Val 


Asn 




40 










75 










80 










85 






41 




ctg 


gca 


aac tta 


cct 


ccc 


age 


tat 


cac 


aat 


gag 


acc 


aac 


aca 


gac 


acg 


343 


42 




Leu 


Ala 


Asn Leu 


Pro 


Pro 


Ser 


Tyr 


His 


Asn 


Glu 


Thr 


Asn 


Thr 


Asp 


Thr 




43 








90 










95 










100 








44 




aac 


gtt 


gga aat 


aat 


acc 


ate 


cat 


gtg 


cac 


cga 


gaa 


att 


cac 


aag 


ata 


391 
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45 


Asn 


Val 


Gly Asn Asn 


Thr 


He 


His 


Val 


His 


Arcr 


Glu 


He 


His 


Lvs 


He 




46 






105 










110 










115 










47 


acc 


aac 


aac 


cag 


act 


gga 


caa 


atq 


crtc 


ttt 


tea 


cracr 

3 3 


aca 


Qtt 


ate 


aca 


439 


48 


Thr 


Asn 


Asn 


Gin 


Thr 


Gly Gin 


Met 


Val 


Phe 


Ser 


Glu 


Thr 


Val 


He 


Thr 




49 




120 










12 5 










130 












50 


tct 


gtg 


gga 


gac 


gaa 


gaa 


ggc 


aga 


aqq 


acre 


cac 


craa 

3 ~J 


tac 


ate 


ate 


aac 


487 


51 


Ser 


val 


Gly Asp 


Glu 


Glu Gly 


Arg 


Arg 


Ser 


His 


Glu 


Cys 


He 


He 


Asp 




52 


135 










140 










145 










150 




53 


craa 
3 j 


crac 


tgt 


ggg 


ccc 


age 


atg 


tac 


tgc 


cag 


ttt 


gcc 


age 


ttc 


cag 


tac 


535 


54 


Glu 


Asp 


Cys 


Gly 


Pro 


Ser 


Met 


Tyr 


Cys 


Gin 


Phe 


Ala 


Ser 


Phe 


Gin 


Tvr 




55 










155 










160 










165 






56 


acc 


tqc 


cag 


cca 


tgc 


egg 


ggc 


cag 


acrcr 

Z) 3 


atg 


etc 


tgc 


acc 


caa 


gac 


aat 


583 


57 


Thr 


Cys 


Gin 


Pro 


Cys 


Arg 


Gly 


Gin 


Arg 


Met 


Leu 


Cvs 


Thr 


Arg 


Asp 


Ser 




58 








170 










175 










180 








59 


aaa 

3 3 


tcrc 


tgt 


gga 


gac 


cag 


ctg 


tgt 


gtc 


tcrcr 


crcrt 

3 3 u 


cac 


tgc 


acc 


aaa 


atg 


631 


60 


Glu 


Cys 


Cys 


Gly Asp Gin Leu 


Cys 


Val 


Trp 


Gly 


His 


Cvs 


Thr 


Lvs 


Met 




61 






185 










190 










195 










62 


gcc 


acc 


agg 


ggc 


age 


aat 


ggg 


acc 


ate 


tat 


gac 


aac 


cag 


aaa 


gac 


tgc 


679 


63 


Ala 


Thr 


Arg Gly Ser Asn Gly 


Thr 


He 


Cvs 


Asd 


Asn 


Gin 


Ara 


Asp 


Cys 




64 




200 










205 










210 












65 


caa 


ccg 


ggg 


ctg 


tgc 


tgt 


gcc 


ttc 


cag 


aga 


crcrc 


ctg 


ctg 


ttc 


cct 


at a 


72 7 


66 


Gin 


Pro 


Gly Leu 


Cys 


Cys 


Ala 


Phe 


Gin 


Arcr 


Glv 


Leu 


Leu 


Phe 


Pro 


Val 




67 


215 










220 










225 










230 




68 


tac 

»-*3 , - 


aca 


ccc 


ctg 


ccc 


gtg 


gag 


ggc 


gag 


ctt 


tgc 


cat 


gac 


ccc 


acc 


aac 


775 


69 


Cys 


Thr 


Pro 


Leu 


Pro 


Val 


Glu 


Gly Glu 


Leu 


Cvs 


His 


Asp 


Pro 


Ala 


Ser 




70 










235 










240 










245 






71 




ctt 


ctg 


gac 


etc 


ate 


acc 


tgg 


gag 


eta 


cracr 

3 ^3 


cct 


gat 


aaa 


gcc 


ttg 


823 


72 


Arg 


Leu 


Leu 


Asp 


Leu 


He 


Thr 


Trp 


Glu 


Leu 


Glu 


Pro 


Asd 


Glv 


Ala 


Leu 




73 








250 










255 










260 








74 


qac 


caa 


tgc 


cct 


tgt 


gee 


agt 


ggc 


etc 


etc 


tgc 


cag 


CCC 


cac 


age 


cac 


871 


75 


Asp 


Arg 


Cys 


Pro 


Cys 


Ala 


Ser 


Gly Leu 


Leu 


Cvs 


Gin 


Pro 


His 


Ser 


His 




76 






265 










270 










275 










77 


acre 


ctcr 


gtg 


tat 


gtg 


tgc 


aag 


ccg 


acc 


ttc 


crtcr 

3 *-3 


aaa 

33 3 


age 


cat 


gac 


caa 


919 


78 


Ser 


Leu 


Val 


Tyr 


Val 


Cys 


Lys 


Pro 


Thr 


Phe 


Val 


Gly 


Ser 


Arg 


Asp 


Gin 




79 




280 










285 










290 












80 


gat 


crcjcr 


gag 


ate 


ctg 


ctg 


ccc 


aga 


gag 


gtc 


ccc 


gat 


gag 


tat 


gaa 


gtt 


967 


81 


Asp 


Gly 


Glu 


He 


Leu 


Leu 


Pro 


Arg 


Glu 


Val 


Pro 


Asp 


Glu 


Tyr 


Glu 


Val 




82 


295 










300 










305 










310 




83 


ggc 


age 


ttc 


atg 


gag 


gag 


gtg 


cgc 


cag 


gag 


ctg 


gag 


gac 


ctg 


gag 


agg 


1015 


84 


Gly 


Ser 


Phe 


Met 


Glu 


Glu 


Val 


Arg 


Gin 


Glu 


Leu 


Glu 


Asp 


Leu 


Glu 


Arg 




85 










315 










320 










325 






86 


age 


ctg 


act 


gaa 


gag 


atg 


gcg 


ctg 


agg 


gag 


cct 


gcg 


get 


gcc 


gcc 


get 


1063 


87 


Ser 


Leu 


Thr 


Glu 


Glu 


Met 


Ala 


Leu 


Arg 


Glu 


Pro 


Ala 


Ala 


Ala 


Ala 


Ala 




88 








330 










335 










340 








89 


gca 


ctg 


ctg 


gga 


agg 


gaa 


gag 


att 


tagatctgga ccaggctgtg ggtagatgtg 


1117 


90 


Ala 


Leu 


Leu Gly Arg Glu Glu 


He 




















91 






345 










350 




















92 


caatagaaat agctaattta tttccccang 


_t gtg tgc ttt 


aagcgtgggc tgaccaggct 


1177 


93 


tctfccctaca tcttcttccc agtaagtttc 


ccctctggct 


tgacagcatg aggtgttgtg 


1237 


94 


catttgttca gctcccccag gctgttctcc 


aggcttcaca 


gtctggtgct tgggagagtc 


1297 
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95 aggcagggtt aaactgcagg agcagtttgc cacccctgtc cagattattg gctgctttgc 1357 

96 ctctaccagt tggcagacag ccgtttgttc tacatggctt tgataattgt ttgaggggag 1417 

97 gagatggaaa caatgtggag tctccctctg attggttttg gggaaatgtg gagaagagtg 1477 

98 ccctgctttg caaacatcaa cctggcaaaa atgcaacaaa tgaattttcc acgcagttct 1537 

99 ttccatgggc ataggtaagc tgtgccttca gctgttgcag atgaaatgtt ctgttcaccc 1597 

100 tgcattacat gtgtttattc atccagcagt gttgctcagc tcctacctct gtgccagggc 1657 

101 agcattttca tatccaagat caattccctc tctcagcaca gcctggggag ggggtcattg 1717 

102 ttctcctcgt ccatcaggga tttcagaggc tcagagactg caagctgctt gcccaagtca 1777 

103 cacagctagt gaagaccaga gcagtttcat ctggttgtga ctctaagctc agtgctctct 1837 

104 ccactacccc acaccagcct tggtgccacc aaaagtgctc cccaaaagga aggagaatgg 18 97 

105 gatttttctt ttgaggcatg cacatctgga attaaggtca aactaattct cacatccctc 1957 

106 taaaagtaaa ctactgttag gaacagcagt gttctcacag tgtggggcag ccgtccttct 2017 

107 aatgaagaca atgatattga cactgtccct ctttggcagt tgcattagta actttgaaag 2 077 

108 gtatatgact gagcgtagca tacaggttaa cctgcagaaa cagtacttag gtaattgtag 213 7 

109 ggcgaggatt ataaatgaaa tttgcaaaat cacttagcag caactgaaga caattatcaa 2197 
HO ccacgtggag aaaatcaaac cgagcagggc tgtgtgaaac atggttgtaa tatgcgactg 2257 
Hi cgaacactga actctacgcc actccacaaa tgatgttttc aggtgtcatg gactgttgcc 2317 

112 accatgtatt catccagagt tcttaaagtt taaagttgca catgattgta taagcatgct 2377 

113 ttctttgagt tttaaattat gtataaacat aagttgcatt tagaaatcaa gcataaatca 2437 

114 cttcaactgc taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 2479 

115 <210> SEQ ID NO 2 

116 <211> LENGTH: 350 

117 <212> TYPE: PRT 

118 <213> ORGANISM: Homo sapiens 

119 <400> SEQUENCE : 2 

120 Met Gin Arg Leu Gly Ala Thr Leu Leu Cys Leu Leu Leu Ala Ala Ala 

121 15 10 15 

122 Val Pro Thr Ala Pro Ala Pro Ala Pro Thr Ala Thr Ser Ala Pro Val 

123 20 25 30 

12 4 Lys Pro Gly Pro Ala Leu Ser Tyr Pro Gin Glu Glu Ala Thr Leu Asn 

125 35 40 45 

126 Glu Met Phe Arg Glu Val Glu Glu Leu Met Glu Asp Thr Gin His Lys 

127 50 55 60 

128 Leu Arg Ser Ala Val Glu Glu Met Glu Ala Glu Glu Ala Ala Ala Lys 

129 65 70 75 80 

130 Ala Ser Ser Glu Val Asn Leu Ala Asn Leu Pro Pro Ser Tyr His Asn 

131 85 90 95 

132 Glu Thr Asn Thr Asp Thr Asn Val Gly Asn Asn Thr He His Val His 

133 100 105 HO 

134 Arg Glu He His Lys He Thr Asn Asn Gin Thr Gly Gin Met Val Phe 

135 115 120 125 

13 6 Ser Glu Thr Val He Thr Ser Val Gly Asp Glu Glu Gly Arg Arg Ser 

137 130 135 140 

138 His Glu Cys He He Asp Glu Asp Cys Gly Pro Ser Met Tyr Cys Gin 

139 145 150 155 160 

14 0 Phe Ala Ser Phe Gin Tyr Thr Cys Gin Pro Cys Arg Gly Gin Arg Met 

141 165 170 175 

142 Leu Cys Thr Arg Asp Ser Glu Cys Cys Gly Asp Gin Leu Cys Val Trp 

143 180 185 190 

144 Gly His Cys Thr Lys Met Ala Thr Arg Gly Ser Asn Gly Thr He Cys 
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145 
146 
147 
148 



AS p Asn Gin Arg Asp Cys Gin Pro Gly Leu Cys Cys Ala Phe Gin Arg 
Oly Leu Leu Phe Pro Val Cys Thr Pro Leu Pro III Glu Gly Glu Leu 



200 



205 



^s His Asp Pro Ala Ser Arg Leu Leu Asp Leu He Thr Trp Glu Leu 
HI Glu Pro Asp Gly HI Leu Asp Arg Cys Pro Cys Ala Ser Gly Leu Leu 

HI cys Gin Pro h" Ser His Ser Leu Val Tyr Val Cys Lys Pro Thr Phe 

HI val Gly HI Arg Asp Gin Asp Gly Glu He Leu Leu Pro Arg Glu Val 

Pro Asp Glu Tyr Glu val oly Ser Phe Met Glu Glu Val Arg Gin Glu 



310 



Leu Glu Asp Leu Glu Arg Ser Leu Thr Glu Glu Met Ala Leu Arg Glu 
325 330 



16X 

162 Pro Ala A±a ju* ^ — ~ --- - - 35Q 



Pro Ala Ala Ala Ala Ala Ala Leu Leu Gly Arg Glu Glu lie 



340 



163 

164 <210> SEQ ID NO 3 

165 <211> LENGTH: 1050 

166 <212> TYPE: DNA 

167 <213> ORGANISM: Homo sapiens 

168 <22 0> FEATURE: 

169 <221> NAME /KEY : CDS 

170 <222> LOCATION: (l)..(1050) 

17 , <40 0> SEQUENCER fcgc cfcg cfcg 48 

172 atg !f g : g9 .... „Z ki» T . m i Leu Cvs Leu Leu Leu Ala Ala Ala 



22 5 = 5 K = = = S = = = = s 

™ ri s s s ri s s e s s s s 2 £ £ s 96 



25 



30 



^1 aaa ccc aae ell get etc age tac ccg cag gag gag gec acc etc aat 144 

Lys Pro Gly Pro Ala Leu Ser Tyr Pro Gin Glu Glu Ala Thr Leu Asn 

i s s sH z = s = = = = = i = = = = 192 

£ 2 £ = = S s £ £ giu S E S E S K S 240 



gca tea tea gaa gtg ale ctg gea aae tta cot eee age tat cac aat 2SS 
HI Ala Ser Ser Glu Val Asn Leu Ala Asn Leu Pro Pro Ser Tyr His 



i 8 9 0 gag aee aae aea gac aeg aae gtt gga aat aat acc ate eat gtg cac 336 

HI 111 Thr Asn Thr Asp Thr Asn Val Gly Asn Asn Thr He His Val His 

105 110 



S S S S = S S S = = = = - = 2 S SS 391 
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120 125 
]ll tea qaq aca gtt ate aca tct gtg gga gac gaa gaa ggc aga agg age 432 

Ser Glu Tar Val He Thr Ser Val Gly Asp Glu Glu Gly Arg Arg Ser 

cae gag tge ate ate gae gag gae tgt ggg eee age atg tac tgc cag 480 
His Glu Cys He lie Asp Glu Asp Cys Gly Pro Ser Met Tyr Cys Gin 



199 
200 



227 



201 III mc acre ttc eag tac ace tgc cag cca tgc egg ggc cag agg atg 528 

HI Te Tla ser III Gin Tyr Thr Cys Gin Pro Cys Arg Gly Gin Arg Met 

i feu S = Arg £ £ = 5 S E = = = £ ^ S " 

5 S = - s £ 2 K S S 5 = £ i S S cf s 624 

gae aac eag agg gae tge eag ecg ggg ctg tgc tgt gee ttc eag aga 672 

212 Asp Asn Gin Arg Asp Cys Gin Pro Gly Leu Cys Cys Ala Phe Gin Arg 

215 220 

213 mr III eta ttc cct gtg tgc aca ccc ctg ccc gtg gag ggc gag ctt 720 

214 ggc ctg ctg ttc ccc gey uy ^ ^ ^ ^ ^ 

215 



% 2 2 £ ™ ^ 5s * »» - «. «i «« «» £ 



230 235 



- £ Mt g ,e « jc. , 9 = « c« «. ,~ «c f = „= £ „ ct. «. 

2i8 Cys His Asp Pro A±a sei my ^« ™p 

245 250 

5 = £ S Z 2 2 2 S 5 2 S 2 2 5 2 2 

265 270 



003 tgc eag eec cae age cae age ctg gtg tat gtg tge aag ecg ace ttc 864 

cys Gin Pro His Ser His Ser Leu Val Tyr val Cys Lys Pro Thr Phe 



224 Cys Gin fro nis - OQC - 

275 280 
„, qt a agg age cgt gac caa gat ggg gag ate ctg ctg ccc 

va! Gly Ser Arg Asp Gin Asp Gly Glu He Leu Leu Pro Arg Glu Val 



225 275 „ a t- ™a aaa ate ctg ctg ccc aga gag gtc 912 



295 300 



| - S s « = s s s 2 2 = = s s = 2 960 

| S s 2 2 2 5 = 2 £ 2 2 2 2 2 5 2 1005 

325 3 3 0 

«t cct gcg get gee gee get gca ctg ctg gga agg gaa gag att ^50 

HI Pro III Ala Ala La Ala Ala Leu Leu Gly Arg Glu Glu lie 

o 4 n 345 " u 

237 340 

238 <210> SEQ ID NO 4 

239 <211> LENGTH : 848 

240 <212> TYPE: DNA 

241 <213> ORGANISM: Homo sapiens 

242 <220> FEATURE: 

243 <221> NAME /KEY : CDS 
<222> LOCATION: (125).. (796) 

?"l W S /or Xaa have been detected in the Sequence Listing. Please review the 
SeL^ ensure that a corresponding explanation is presented in the <220> to 

<223> fields of each sequence which presents at least one n or Xaa. 



J 
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PATENT APPLICATION US/09/009 , 802B TIME: 16:31:13 



Input Set: I009802B.RAW 



Line ? Error/Warning 



92 W "N" or "Xaa" used: Feature required 
448 w "N" or "Xaa" used: Feature required 
593 W Invalid/Missing Amino Acid Numbering 



Original Text 



caatagaaat agctaattta tttccccang tgtgtgct 
accccatttn attctagagt cnagaacgca aggatctc 



